03 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Hanna, Michael C. 

Kirkness, Ewen F. 

(ii) TITLE OF INVENTION: GABA A Receptor Epsilon Subunits 

(iii) NUMBER OF SEQUENCES: 4 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox P.L.L.C. 

(B) STREET: 1100 New York Avenue, NW, Suite 600 
{C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 
P (A) MEDIUM TYPE: Floppy disk 

yp (B) COMPUTER: IBM PC compatible 

£*i (C) OPERATING SYSTEM: PC-DOS /MS-DOS 

|"7i (D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 
Ul (B) FILING DATE: Herewith 

m (C) CLASSIFICATION: 



L 3 (vii) PRIOR APPLICATION DATA: 

!fj (A) APPLICATION NUMBER: US 08/888, 012 

(B) FILING DATE: 03-JUL-1997 
Ft! (C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Steffe, Eric K. 

(B) REGISTRATION NUMBER: 36,688 

(C) REFERENCE/DOCKET NUMBER: 1488 . 0950001/EKS/SGW 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 371-2600 

(B) TELEFAX: (202) 371-2540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3872.. 4597 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCTTATAAT TGCTAAGCAC TTACAACTGT TTGCAGAGGA AACTGAGACT TTGTAACTAT 60 

GTCTCAGTCT CATCTGCAAA GAAGTAAGTG CTTTGCCAAG CTCCTTGAGA GGTTAGGTAA 120 

GTAGATAAAG TTCTGCTGCT GTCGGAATGT GCAGCTGGCT TTTTCATGCA GACCCTTCAG 180 

TTTCGAGGTT ACAACTCTGA CCTCTTTGGA TGACTTTGGG GAATGGAGCT CGTGTGAGTT 24 0 

CTCCATACCC AGAACCAATC CAGTCTGGTT GAATGGGAAG CAAAGTCCAT TGTAGTGGGA 300 

GGTGGAGGCT AGAGTTCTAA TGTCAGCTAG TTTAAGGCTG GGAAAGTCTG GAGGAAGTTA 360 

CAGCAGCTAC ACTGGCTGCT GCATTGACAT TTATCTTAAA GGAACAAGTC T G AAAAGC AC 4 20 

AGATTCTTAT CAAAGGCTTC ATGGTGGATT C C AC AT AG AC ATAGTGGCCA CTGGTTTTCT 4 80 

GACCTTTTCT CTGACAAAGA CTAAAGGGGA AGGTCCTGGG TATCTTACAC TTCAGCTCCC 54 0- 

AATTAGATGT GAGCACCTTC ACTTATGTTC CTAGGTGACC TGAATGAGGA GCCAAGGGAC 600 

JF\ CTCCCCAGGG TAGCTCCCAG AGCAACCCTG GAAACACTCT TCACACATCC TGACCAAGTT 660 



CAGGGCAGTG AAGGCACTGC CCTCATCGTT TCCAGAATGT GGATGGAGCC AGTCACCCAA 720 
CCAGCCATTT GTCGTGAGAG GCATCTTGTT CTGCTACCAT GTGACTAGGC AG AAAAT C T G 7 80 



Li! CTTTTGTTTC AT T TAT T GAG TCAGTCTCTG GAT GAG GG AA AGCTCATGCT CATGTGGCTA 840 

nl 

GAGCTTTGCT TGCACAGTAT TAGGCAGGGG CAGAGGGCTG GGCTACCTTA AAAATACTTG 900 

CCCTTTTTCT TGGGGACTCT GGGGAAGCGG TTTTACTACC TTTGACTTGG GAGCCTTGCT 960 

nJ 

rti CTTCTGCCAG CTAACCATGG GCCTGCCTCT TGGTTTTCTG CACCTCAGCT TTTCCCGGAT 1020 

m 

if [ AGGTGGGGAC C CAT CAT C AA AAG T G AC AG A GAAGATAAGG CCCAGGGGCT TTCAAGTCAC 1080 

UJ TAGTGGTTCC GTTTAGTAGA TGATTGTGCA TTGTTTCAAA ATGGTGCCCT AGTGACTACA 1140 

AAGCCCCAGA GCCAGCATCA TCATCAAAGC AATGACAGTA GGTAAGCACC AGACCTCCTT 1200 

GGGAGTGAGG AGGATTCTTG AGGAGAAAAG AGGTCTTCTT TCTCCTCTGC TGGAGACTAG 12 60 

TTGATCTGGA GACGTGGTTC CTTCAATGTC AGAGTTATCT TTGGGACTGG TCTCAAACTC 1320 

TTCCAGTTGG GCCCTGGGGC AGGTCTCTCC ATCTGGAGCA TACTTACGTG CTCGGCGATT 1380 

AAGGGTTCAG AATGCAGTGG TAGCCTGCTA CTCTGGCCAT CTTGGACCTT GATCCAGAGA 14 4 0 

ATCTCTGCTT CAGGAGCTTC T AAG AG AG T C CAGCCCTGCC TCCAGAGAGA GGCTTGCCCT 1500 

TCACTGATGG CTGTGGAGCC TCTGATGGAA TATTATTGCT GGTCAGGAAT TCACTGTCTT 1560 

ACAAGGAGGT TTCCTTCTTC TCTAGACAGT TCTGTTCATC AAAAAACTCT CCCTGTTCTT 1620 

CTGAAATTGG AGTCTCTGGA AGTTCCACAC ATTAAGCTTA GTTCTTTTTC CTTGGAACTG 1680 

TCCAGGTTAC ATTAGTCCAG CCACTGTTTC ACAGGACCGA GATTAAACGA TCAACATCAT 174 0 

CATTCCCGGC AT GGATC AT A GTCTGTTGTA GTCTACATAG CCCTAGTTTA TTTTTCTTCC 18 00 




-104- 

CTTATTCTTC AAAGCTTTGG GTCCATTCAT TCTTCTAGTC CCAGTCCTCT GGACATGGTC 18 60 

TATTTAATTG TGTCCCTCTG ACACTGCAGT GACCAACCAT GATCTGGTCA AAGAGGATAA 1920 

GAGTTTGAGC AG AAAAC CAT CTTTAGCATA TATTTTTTTG CTTTGGTTCA TCAGCCCCAG 1980 

ATATATTGTT TTCCTTACCC GTGCTTCTCT CACTCCTCAA GAAGAAGAAA GTGTGTGTTA 204 0 

GCATCTTTCT CTTGTCCTTC AAGACAAATT GGCATCTCTT GACGAGCGGA GAAGGTTCTT 2100 

TTTTGGCCAG AATAAATAAA ATTAAAATAG AATCATCCAA CAGAATAATA AATCTTCGTG 2160 

CAACAAGAAT ATATTATATA AACCCAGCAA TTTTGCAGGG CCTGGGTATA AC T AAT TAG A 2220 

AGTGTCTTAA ATTGCAGTCA AGATCCCACG GCAAGAGGAC TTTTGATAAA TACATTCTGG 2280 

CCAGTAGGCA AGTGCGAGGG TGGTCCGTGC AGCAGCTCTG GAGGAGTTCT ATCCCAAAGC 234 0 

TATACTCAAC ACACAGGTTT CCCACTGACA ACAGGTCGCT CCCTTGCCTT CTTCCAGAAG 24 00 

AATCTGAGAA GCTTTGCTCC TTGAGTTTCA GTGCTGCCAA GGTGAGTACG AAAGGCTGCT 24 60 

CTTCTCATTC AGCTCCAGCC CACCCAGACC TGCTGGGCAG TTGATCCACT TTCCAAAATA 2520 

pi 

70* GGAGGACACA CGGACAGGTT AGTGTTCTGG TCTGCTTTAC AAAGCTGTTG CCTGACAGGA 2580 

2 GCAAGAGTTG CTGAGTGTCT GCTGGGTTCC AGGCTGTTCT GAGCTTGGAT GGGCAGGGGC 2 64 0 

w 

U3 TAAGCCACAG GGCCTGCATG AGCCCTGCCT TGAAGGGACT TAAAAGACGA C C T AAT TATA 2700 

ru 

GGCCTAGGAA TTTTACAGTA TTGCAACTGC AATGTGATGC TGAAAGTGGA AAATGATGTC 27 60 

CTGGGCTCAG AGAAAAGCCC ACACCAGCCT GGGAGTCATG ATAGCAGCAG AGTGCTTGGG 2820 

5 3i 

S ~- 

fU GAGGGTGTGT CAGAGCATAA AGCAGCATGA AT G C T AC AAA AGAAGATGCC AACTAGAGAT 2880 

y ; 

yp ATAGGTTGTC ATCAGGTCCC G GAG GAG CCA TGACCGTCTA GCTGAGAGCC ATGACCAAGG 2 94 0 

^ ? AC AC AAT G T C CAAGTGACTG TGAGGACCTC AGTCTGCCCT GTGGATGTGT AT G C C AC AG A 3000 

CCTGACTTCT GGAGGGCTGA CTGAAATGTT CATTTTAAGC TTTTTCTTCT CTTTCCCTGA 3060 

AACACTCAGT TTGGGTTAGG GGTCATAGAC TAAGACCAAA GAGTCCAGGG T TAG AAT C T T 3120 

GGTGTAAAAT TGCAGGCCAT CTCAGGAAAT CTGTGAGCAG ATGGGATTGG CTTTGGGTAA 3180 

GGTGCGTGTG GAAAATGTCA GTGGGAGCCG GGTCATGGTG GGCCTTTAGC AT C AG AT T C C 324 0 

AGAGTGCAGA TAGTCTGTAT AGCTCATGTG AAACAGGGAG CCACCAAAAC TTTGGGGAGC 3300 

AGGCTAGTGC CGGTTTTGAC CACCTGTGGA GCAGTGCTCA CTCACGAAGG CATTTTGCCA 3360 

TCACATGAAT GTGCAGAAAG GAGGCCAAAA GCATTCTGTG CTTCTCCACC ACAGCACAGA 34 20 

CTTCCCTAGT CTCATTTGCT GAG AG TAG AC ATTCTGAGGG CCAGCAGTGC AGGTGTGATG 34 80 

TGCCTCAGAG GGTATGAAGC CCTTAGTCAG CCATCTGGAT ATCAGCTGCG TGGGCATGAT 354 0 

ATCTAGAAGG CTAATTGATT TTTTCACTTT CACCTGACTC TCTTGCCAAC CTGCAGAGAC 3600 

AGACATTGGG TGTAGGACAG TGAACTGAGA AGGAAGCTAT TAAGATTCTG GCCTTGGCTT 3660 
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AGCTCTCAAC TGGCCATTGG TCTTGCAGTA AGTCTTTTTT CTGGGCTTCT TCTGGTCCTA 3720 

TTTGTATGTA TTGCATTGTC ACATCATGCC TCTATCCTAG GGAATACTGT GAGCTGAAAA 3780 

ATGAGACCCT TACTGTTCAC GTCCTGCTAA GGGGGACCGT CGTGTCAGCA CTGTAATGCA 384 0 

GTGATGTTTT TTGTGTCTTT CAGGTGACTT C ATG GTC ATG ACG ATT TTC TTC 38 92 

Met Val Met Thr lie Phe Phe 

1 5, 

AAT GTG AGC AGG CGG TTT GGC TAT GTT GCC TTT CAA AAC TAT GTC CCT 394 0 

Asn Val Ser Arg Arg Phe Gly Tyr Val Ala Phe Gin Asn Tyr Val Pro 
10 15 20 

TCT TCC GTG ACC ACG ATG CTC TCC TGG GTT TCC TTT TGG ATC AAG ACA 398 8 

Ser Ser Val Thr Thr Met Leu Ser Trp Val Ser Phe Trp lie Lys Thr 
25 30 35 

GAG TCT GCT CCA GCC CGG ACC TCT CTA GGG ATC ACC TCT GTT CTG ACC 4 036 

Glu Ser Ala Pro Ala Arg Thr Ser Leu Gly lie Thr Ser Val Leu Thr 
40 45 50 55 

ATG ACC ACG TTG GGC ACC TTT TCT CGT AAG AAT TTC CCG CGT GTC TCC 4 08 4 

Met Thr Thr Leu Gly Thr Phe Ser Arg Lys Asn Phe Pro Arg Val Ser 
60 65 70 



W TAT ATC ACA GCC TTG GAT TTC TAT ATC GCC ATC TGC TTC GTC TTC TGC 4132 

%£\ Tyr lie Thr Ala Leu Asp Phe Tyr lie Ala lie Cys Phe Val Phe Cys 

hi 75 80 85 

III 

TTC TGC GCT CTG TTG GAG TTT GCT GTG CTC AAC TTC CTG ATC TAC AAC 4180 

Lri Phe Cys Ala Leu Leu Glu Phe Ala Val Leu Asn Phe Leu lie Tyr Asn 

W 90 95 100 

ru 

TU CAG ACA AAA GCC CAT GCT TCT CCT AAA CTC CGC CAT CCT CGT ATC AAT 4 22 8 

Of* Gin Thr Lys Ala His Ala Ser Pro Lys Leu Arg His Pro Arg lie Asn 

yg 105 110 115 

AGC CGT GCC CAT GCC CGT ACC CGT GCA CGT TCC CGA GCC TGT GCC CGC 4 27 6 

Ser Arg Ala His Ala Arg Thr Arg Ala Arg Ser Arg Ala Cys Ala Arg 

120 125 130 135 

CAA CAT CAG GAA GCT TTT GTG TGC CAG ATT GTC ACC ACT GAG GGA AGT 4 324 

Gin His Gin Glu Ala Phe Val Cys Gin lie Val Thr Thr Glu Gly Ser 

140 145 150 

GAT GGA GAG GAG CGC CCG TCT TGC TCA GCC CAG CAG CCC CCT AGC CCA 4 37 2 

Asp Gly Glu Glu Arg Pro Ser Cys Ser Ala Gin Gin Pro Pro Ser Pro 

155 160 165 

GGT AGC CCT GAG GGT CCC CGC AGC CTC TGC TCC AAG CTG GCC TGC TGT 4 4 20 

Gly Ser Pro Glu Gly Pro Arg Ser Leu Cys Ser Lys Leu Ala Cys Cys 

170 175 180 

GAG TGG TGC AAG CGT TTT AAG AAG TAC TTC TGC ATG GTC CCC GAT TGT 4 4 68 

Glu Trp Cys Lys Arg Phe Lys Lys Tyr Phe Cys Met Val Pro Asp Cys 

185 190 195 

GAG GGC AGT ACC TGG CAG CAG GGC CGC CTC TGC ATC CAT GTC TAC CGC 4 516 

Glu Gly Ser Thr Trp Gin Gin Gly Arg Leu Cys lie His Val Tyr Arg 

200 205 210 215 
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CTG GAT AAC TAC TCG AGA GTT GTT TTC CCA GTG ACT TTC TTC TTC TTC 4 564 
Leu Asp Asn Tyr Ser Arg Val Val Phe Pro Val Thr Phe Phe Phe Phe 
220 225 230 

AAT GTG CTC TAC TGG CTT GTT TGC CTT AAC TTG TAGGTACCAG CTGGTACCCT 4 617 
Asn Val Leu Tyr Trp Leu Val Cys Leu Asn Leu 
235 240 

GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA 4 677 

AAGCAGCAGC AGCAGCAGGA GCG AC TAG AG TTTTTCCTGC CCCATTCCCC AAACAGAAGC 4 737 

TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT CCCCTACCTG GCCCATTCAC TGAGTCTTCT 4 7 97 

CAGCAGACCA TTTCAAATTA TTAATAAATG GGCCACCTCC CTCTTCTTCA AGGAGCATCC 4 857 

GTGATGCTCA GTGTTCAAAA CCACAGCCAC TTAGTGATCA GCTCCCTAAA ACCATGCCTA 4 917 

AGTACAGGCG GATTAGCTAT CTTCCAACAA TGCTGACCAC CAGACAATTA CTGCATTTTT 4 977 

CCAGAAGCCC ACTATTGCCT TTGTAGTGCT TTCGGCCCAG TTCTGGCCTC AGCCTCAAAG 5037 

0 

JF\ TGCACCGACT AGTTGCTTGC CTATACCTGG CACCTCATTA AGATGCTGGG CAGCAGTATA 5097 

H ACAGGAGGAA GAGATCCCTC TCCTTTGGTC AGATTATTAT GTTCTCAGTT CTCTCTCCCT 5157 

O GCTACCCCTT TCTCTGCAGA TAGATAGACA CTGGCATTAT CCCTTTAGGA AGAGGGGGGG 5217 

63 

U\ GCAGCAAGAG AGCCTATTTG GGACAGCATT CCTCTCTCTC TGCTGCTGTG ACATCTCCCT 5277 

m 

CTCCTTGCTG GCTCCATCTT TCGTCTGCAC TAC C AAT T C A ATGCCCTTCA TCCAATGGGT 5337 

^ ATCTATTTTT GTGTGTGATT ATAGTAACTA CTCCCTGCTT TATATGCCAC CCTCTTCCTT 5397 

nJ 

ftj CTCTTTGACC CCTGTGACTC TTTCTGTAAC TTTCCCAGTG ACTTCCCCTA GCCCTGACCC 54 57 

EH 

.fj AGGCACTAGG CCTTGGTGAC TTCCTGGGGC CAAGAAACTA AGGAAACTCG GCTTTGCAAC 5517 

^ AGGCATTACT CGCCATTGAT TGGTGCCCAC CCAGGGCACA CTGTCGGAGT TCTATCACTT 5577 

GCTTGACCCC TGGACCCATA AAC C AG T C C A CTGTTATACC CGGGGCACTC T AAC CAT C AC 5637 

AAT C AAT C AA TCAAATTCCC TTAAATTTGT ATGGCACTGG AACTTTGGCA AAGCACTTTT 5697 

GACAAGTTGT GTCTGATTGG AGCTTCATGA TAGCCTTGTG ACATCTTTAG GGCAGGATTC 5757 

TTATCCCCAT TTTGCAGATG AAAACCCTGA GTCACAGATT TCTGTGGGAC TGTGGATCTC 5817 

ACTGGAAGCT ATCCAAGAGC CCACTGTCAC CTTCTAGACC ACATGATAGG GCTAGACAGC 587 7 

TCAGTTCACC ATGATTCTCT TCTGTCACCT CTGCTGGCAC ACCAGTGGCA AGGCCCAGAA 5937 

TGGCGACCTC TCTTTAGCTC AATTTCTGGG CCTGAGGTGC TCAGACTGCC CCCAAGATCA 5997 

AATCTCTCCT GGCTGTAGTA AC CC AG TGG A ATGAATTTGG ACATGCCCCA ATGCTTCTAT 6057 

ATGCTAAGTG AAATCTGTGT CTGTAATTTG TTGGGGGGTG GATAGGGTGG GGTCTCCATC 6117 

TACTTTTTGT CACCATCATC TGAAATGGG 614 6 



(2) INFORMATION FOR SEQ ID NO : 2 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 

Met Val Met Thr lie Phe Phe Asn Val Ser Arg Arg Phe Gly Tyr Val 
15 10 15 

Ala Phe Gin Asn Tyr Val Pro Ser Ser Val Thr Thr Met Leu Ser Trp 
20 25 30 

Val Ser Phe Trp lie Lys Thr Glu Ser Ala Pro Ala Arg Thr Ser Leu 
35 40 45 

Gly lie Thr Ser Val Leu Thr Met Thr Thr Leu Gly Thr Phe Ser Arg 
50 55 60 

3 Lys Asn Phe Pro Arg Val Ser Tyr lie Thr Ala Leu Asp Phe Tyr lie 

f\ 65 70 75 80 



ffl 



ri 



ru 
en 



Ala lie Cys Phe Val Phe Cys Phe Cys Ala Leu Leu Glu Phe Ala Val 
85 90 95 

Leu Asn Phe Leu lie Tyr Asn Gin Thr Lys Ala His Ala Ser Pro Lys 
100 105 110 

Leu Arg His Pro Arg lie Asn Ser Arg Ala His Ala Arg Thr Arg Ala 
115 120 125 

Arg Ser Arg Ala Cys Ala Arg Gin His Gin Glu Ala Phe Val Cys Gin 
130 135 140 

lie Val Thr Thr Glu Gly Ser Asp Gly Glu Glu Arg Pro Ser Cys Ser 
145 150 155 160 

Ala Gin Gin Pro Pro Ser Pro Gly Ser Pro Glu Gly Pro Arg Ser Leu 
165 170 175 

Cys Ser Lys Leu Ala Cys Cys Glu Trp Cys Lys Arg Phe Lys Lys Tyr 
180 185 190 

Phe Cys Met Val Pro Asp Cys Glu Gly Ser Thr Trp Gin Gin Gly Arg 
195 200 205 

Leu Cys lie His Val Tyr Arg Leu Asp Asn Tyr Ser Arg Val Val Phe 
210 215 220 

Pro Val Thr Phe Phe Phe Phe Asn Val Leu Tyr Trp Leu Val Cys Leu 
225 230 235 240 

Asn Leu 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TCCCCGGCTC CCCACCGCAG CAGCCGTCAC GTCGTCGGAG ATTTCCATCG GGGCGGGCCT 60 

GGGGCGGGGA GCGCGGGACG GGGCGGGGCG AGTGGGAGGA GTGAAAGTTG GAGCCCAGCA 120 

AAAGCCTCCG CCCCGCGCTC AGTGCGGCCA GAGCGTGAGC CGCGACCTCC GCGCAGGTGG 180 

TCGCGCCGGT CTCCGCGGAA ATGTTGTCCA AAGTTCTTCC AGTCCTCCTA GGCATCTTAT 24 0 

TGATCCTCCA GTCGAGGTGA GTCTCCATCC CGGGACCCGG GAGCCCTTCG CGCCCAGCTC 300 

CCTCTCCCCG GGAGCCGGGA CGGCTCCCGG GACCCCAGCG GCCCCGCGTT CCTCGAGCCC 360 

O 

Jfj CGCGCCCGCT TTGCCCCGGC CCTACCGCGG GCTGGCCGAG TCCCGCGTCC CCTCGATGCG 4 20 

fjj CGCCGGCCTC GGCCCGCCTC ACTGTAGGAT GGGCTCCCGG GGTCCTTGAG GGGGAGCTCC 4 80 

O AAAAGGAAGA CAGGACGCCA GAAGGAAGAC GGGACTCCAG TTCGCGGATT CCCGCTCTCA 54 0 

03 

UJ AAAGCACTGC GGTGGC 556 

m 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
jW (A) LENGTH: 1097 base pairs 

nJ * (B) TYPE: nucleic acid 

QR (C) STRANDEDNESS: double 

,f| (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGCACTAGCT GGGGCCCCTA CAGAGTGCAG GGCAGAGCTT CATTTTTCGT TTGAATGTTA 60 

TAGGGTCGAG GGACCTCAGA CTGAATCAAA GAATGAAGCC TCTTCCCGTG ATGTTGTCTA 120 

TGGCCCCCAG CCCCAGCCTC TGGAAAATCA GCTCCTCTCT GAGGAAACAA AGTCAACTGA 180 

GACTGAGACT GGGAGCAGAG TTGGCAAACT GCCAGAAGCC TCTCGCATCC TGAACACTAT 24 0 

CCTGAGTAAT TATGACCACA AACTGCGCCC TGGCATTGGA GGTGAGGAGC AGAACGACGT 300 

TCTTCCCCTC CTAGAGGGTC CAGGGGTTGA GGGCATAGGC ATGGAGAATG CACCTGGGCA 360 

GTAACAGAGG GTGCCATGCT CATGGACAGG AACATCTGCT ATTGACCTGT CAGGTAAGAG 4 20 

ATATTAACTC TATTCTCAGC AGTGTCATTG ACCTTGATCA AGACTTTTCC CTTCTCTCGC 4 80 

CCTCAGTTTT TCCAGTGGTA AAATGAGAGG ACTAAACTAG ATTGTTGATC TTCAAGATGT 54 0 
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GTGTCCAATT CTTAACAGTC CGTGAGCTTG GTTTTGCCAT GAAAGAATAA ATAAAGAAAT 600 
AGGATTAGAT GCTGAAACTG TGTGGTCCAA CACTTACTTG ACTCCCCTTT CATCCCCTCT 660 
GACCACTTCC TCCCCCGTCC CATGCGCCTG TTTGACACTT ACCCTCTGCT GCTTCTGCTT 720 
CCCTTATAGA GAAGCCCACT GTGGTCACTG TTGAGATCTC CGTCAACAGC CTTGGTCCTC 780 
TCTCTATCCT AGACATGGTG AGTACTAAGC TTTTTTAGTA CTATTTCCTA GCCCAGGGGC 84 0 

TGACCTATGG GACCTTCCAC AGACTTCTGC TTTCTGCTCT GTACTTCTGT AACAACTCCA 900 
ATAATTTATT TTCTGGAGGG AGAAAGGGAT TTTTTAACCA CTGGTTTGAG AATGAGACTG 960 
GAAAAGGTAA GTCCCTTGCT ACTTGAAGAG GATCTTCAGA AT CAT G AC C A TATCTTCCAG 1020 
TTTTTTCATT CAAAATAGAA ATAATAAAGC AATGTTAAAC ACCACAATGG ACTGCCTCTT 1080 
TTCTTCTGAT AATTATT 1097 
(2) INFORMATION FOR SEQ ID NO : 5 : 

§*\ 

tg (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 9631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



IV? 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGCATGATGT GAGGACCTAG CTCCCCTTGC TCACGGTAAT CAAGCAGGGC ATGACTGACT 60 

TCTCATTTGG GATCATTAGG CTTGGTAACC TCTGAGTTCC CTTCTAGTTC TGACATTGGA 120 

AGAGTCTTTG AGCAGAGAGA AGGGACCTCA TCTGACTCCT GCTTCCTGGT CTAGTGCTCA 18 0 

ATGCATTCCA CCACATTGCC TTTAGGATTC TAGGTTAGGG AGTGGCAAAC AATATGTTTG 24 0 

CATGTGAATG TCCTTTTTCT GTTCATCCCC AAGTATGTGC TTTTCTGTCC TTCCCACCAG 300 

GAATACACCA TTGACATCAT CTTCTCCCAG ACCTGGTACG ACGAACGCCT CTGTTACAAC 3 60 

GACACCTTTG AGTCTCTTGT TCTGAATGGC AATGTGGTGA GCCAGCTATG GATCCCGGAC 4 20 

ACCTTTTTTA GGAATTCTAA GAGGACCCAC GAGCATGAGA TCACCATGCC CAACCAGATG 4 80 

GTCCGCATCT ACAAGGATGG CAAGGTGTTG TACACAATTA GGTATGTCAA GCCTCTGGAG 54 0 

TCTCACTTCC TGGAATTCTC TCTCCCCTTC TGATAATTTT AGCTAAAGAT CCATGGGCAG 600 

AGATCTCATC CTGAATGATA CCTCTAAGGG CCTGTCCAGC TTTCCTAGAC CATGAGCTCA 660 

GCCCCCTTAT GTAACAGATA TAGAGGCCTC AAAATAGAAA GATATTGCTT AAAGCCACAC 720 

ACCAAGTTTG TGGCAGAGCT GGAACTGGTA CTCAGTTACT TGGCTCCGAG TCCAGAGCTC 780 

CCTCAACTAG GATGTGCCAG TATGACTGCA T TAT C TAG AC AATTCCATCC TACGTGGGCA 840 




CTCGATACAA AGATACGTCC ACAGTGGTGG 
GGCAAAGGTA CCTAAGATCA AGTTGGATAC 
GGGGATAGCA GGGAGGATGT TGGCAGTTCC 
CCGATCGAAG TTGCTCCATA CGTTTCTCTA 
TAAGCTGTTT GTTTGTTTTG TTTTGTTTTG 
TGTTATCTCT CCTTTGAGCT TTTTGTCTTA 
ACGCCTGTGA TCCCAGCACT TTGTGAGGCT 
TCGAGACCAG CCTGGCCATC ATGGGAAAAC 
AGGGTGTGCT GGCACTAATT CCAGCTACTC 
CTGGGAGGCA AGAGGCTGCA GTGAGCTGAC 
AG AG T GAG AC TCTGTCTCAA ACAAACAAAG 

Pi 

yTj GTAGTCCCTA GACCAAAGCT GGGTGGATAC 

H CCTTTCTCCC TTCCAGGATG ACCATTGATG 

H CAATGGATTC TCACTCTTGC CCTCTATCTT 

m 

yi TGGGGCCCCA GAAACATGCT GGGCTCCTTC 

HI 

5 TGCCTCTGCT TTTCTTCTAA AATGCTGCCA 

J?! CATTTCCTTT CCTGCCAACA ATACTGTGTT 

fU ATCTCAAGTA TGTCTGCTCC TCTCCATCTC 

ifj TATCTTCCAC CTGGACTTTT GCCACATCTT 

W ATTCATTGTC TCCTCCAAAC CTCTACCACC 

CTGTCTCATT GAAGAGGACC ATCACTCACC 
CCTCTCCCTC ACCTCCTGCA TCTAATCAAT 
CTCTATCGAT GCTGTCTATT TCTCTGCACC 
TTTAATTCTG CCTGGATTAC TACACTGGCC 
ACCAGAGTGA CCTAGCTGAA GGGTCACCTA 
CGTTAACTCT CATGGATCAA TTTGAAATTC 
CTGGACCCTG CCACCCAATC CTGTGCACCT 
TTGTCCTTTC CACAATACAC CAGGTGTTTC 
ACCTCATTCT TATTGAGAAT ATTTACTTGA 
TTTATGAAGT CTTTTCTGAG TATGTCCCCA 
GTTCTGTGGA CTTAGGTTTT TCAGAGCTCC 




-110- 

AATTGTTCAG GCAGAGCAGC AGCACGTAGT 900 

TTGAATTCCC AGCAGGGGAA GGTTGTGTGT 960 

TGGAAACTAG GGTGGGCGAG AAAACAAAAG 1020 

ATGATGGAGC CCAGAGTAAC CAGATACTTC 1080 

TTTTGTTTTG TTTTGTTTTG TTTTCTCTCT 114 0 

AATTCTAGCG AGGTCCAGGC ACGGTGGCTC 1200 

GAGGCAGGCA GATCACTTGA GGTCAGGAGT 1260 

CCTGTCTCCA CTAAAAATGC AAAAATTAGC 1320 

GGGAGGCTAG G G CAT G AG AA TTGCTTGAGC 1380 

ATCACGCCAC TGCCCTCCAG CCTGGGTGAC 14 4 0 

AAAAAAATTG ACTCTGGCCA TTCATTGGTG 1500 

GGAAGTGCTT AGGGCCAGCC TGATGAGGCT 1560 

CCGGATGCTC AC T C C AC AT G CTCAGATTTC 1620 

TCTCTAGCTG TGAGTACCTT CTTAAGTTTC 1680 

TTTTTCTCAT CCTTGCCATT TACATTTTTC 174 0 

AGGTTGTGCA GGACTTCCAT CCTCCACCCT 1800 

GCTCATCCCT TCCACGTGCC TCTGAAGCGT 18 60 

CACTGGCACT ACCTTGGTTT AGGCCTTTGT 1920 

CACTTTGAAA CTGCACATGT CCAAAATGAA 1980 

AAAACAAGTG TGTTGCTTCT GGGTTCCCAT 204 0 

CAGTTGCGCA AATCAAGAAC TTTGATGTTT 2100 

CAGCACATCC TGTTGGTGTT TCCTCCCAGT 2160 

CTGTACAGCT TTGACTTCCA CCTGCATTAA 2220 

TCCTTGACAA CATGTTGTCC TCACAGAAGG 2280 

GGTTGGGTCA CTTCTTAGTC TCGAATCTGC 234 0 

CTTAGAATGA ACCTCAAGGC CATTCATGAA 2 4 00 

CATCCTCTGT GAGCTAGCCA TCCTGAACTT 24 60 

ACCTTTCTAT ACTGCCCCTT AACCCCTTCA 2520 

GTTTCAAGAT TTAATGGGAA TATCACCTGC 2580 

AGTGACCTTT ATCTACTTTG TTTCCCCGCT 2 64 0 

TCCAAAAATC ACAGTAGTAT ACTCACTGTC 27 00 
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GTCTCTCCAT CTGGAGCATA CTTACGTGCT CGGCGATTAA GGGTTCAGAA TGCAGTGGTA 4 620 

GCCTGCTACT CTGGCCATCT TGGACCTTGA TCCAGAGAAT CTCTGCTTCA GGAGCTTCTA 4 680 

AGAGAGTCCA GCCCTGCCTC CAGAGAGAGG CTTGCCCTTC ACTGATGGCT GTGGAGCCTC 4 74 0 

TGATGGAATA TTATTGCTGG TCAGGAATTC ACTGTCTTAC AAGGAGGTTT CCTTCTTCTC 4 800 

TAGACAGTTC TGTTCATCAA AAAACTCTCC CTGTTCTTCT GAAATTGGAG TCTCTGGAAG 4 8 60 

TTCCACACAT TAAGCTTAGT TCTTTTTCCT TGGAACTGTC CAGGTTACAT TAGTCCAGCC 4 920 

ACTGTTTCAC AGGACCGAGA TTAAACGATC AACATCATCA TTCCCGGCAT GGATCATAGT 4 98 0 

CTGTTGTAGT CTACATAGCC CTAGTTTATT TTTCTTCCCT TATTCTTCAA AGCTTTGGGT 504 0 

CCATTCATTC TTCTAGTCCC AGTCCTCTGG ACATGGTCTA TTTAATTGTG TCCCTCTGAC 5100 

ACTGCAGTGA CCAACCATGA TCTGGTCAAA GAGGATAAGA GTTTGAGCAG AAAACCATCT 5160 

TTAGCATATA TTTTTTTGCT TTGGTTCATC AGCCCCAGAT ATATTGTTTT CCTTACCCGT 5220 

£3 

GCTTCTCTCA CTCCTCAAGA AGAAGAAAGT GTGTGTTAGC ATCTTTCTCT TGTCCTTCAA 52 8 0 



a 

hi 



GACAAATTGG CATCTCTTGA CGAGCGGAGA AGGTTCTTTT TTGGCCAGAA TAAATAAAAT 534 0 

U TAAAATAGAA TCATCCAACA GAATAATAAA TCTTCGTGCA ACAAGAATAT AT TAT AT AAA 54 00 

OB 

LJ CCCAGCAATT TTGCAGGGCC TGGGTATAAC TAATTAGAAG TGTCTTAAAT TGCAGTCAAG 5 4 60 

h\ 

ATCCCACGGC AAGAGGACTT TTGATAAATA CATTCTGGCC AGTAGGCAAG TGCGAGGGTG 5520 

GTCCGTGCAG CAGCTCTGGA GGAGTTCTAT CCCAAAGCTA TACTCAACAC ACAGGTTTCC 5580 

ru 

fU CACTGACAAC AGGTCGCTCC CTTGCCTTCT TCCAGAAGAA TCTGAGAAGC TTTGCTCCTT 5 64 0 

.>! GAGTTTCAGT GCTGCCAAGG TGAGTACGAA AGGCTGCTCT TCTCATTCAG CTCCAGCCCA 5700 

^ CCCAGACCTG CTGGGCAGTT GATCCACTTT CCAAAATAGG AGGACACACG GACAGGTTAG 57 60 

TGTTCTGGTC TGCTTTACAA AGCTGTTGCC TGACAGGAGC AAGAGTTGCT GAGTGTCTGC 5820 

TGGGTTCCAG GCTGTTCTGA GCTTGGATGG GCAGGGGCTA AG C C AC AG GG CCTGCATGAG 588 0 

CCCTGCCTTG AAGGGACTTA AAAGACGACC TAATTATAGG CCTAGGAATT TTACAGTATT 5 94 0 

GCAACTGCAA TGTGATGCTG AAAGTGGAAA ATGATGTCCT GGGCTCAGAG AAAAGCCCAC 6000 

ACCAGCCTGG GAGTCATGAT AGCAGCAGAG TGCTTGGGGA GGGTGTGTCA GAG C AT AAAG 6060 

CAGCATGAAT GCTACAAAAG AAGATGCCAA CTAGAGATAT AGGTTGTCAT CAGGTCCCGG 6120 

AGGAGCCATG ACCGTCTAGC TGAGAGCCAT GACCAAGGAC ACAATGTCCA AGTGACTGTG 6180 

AGGACCTCAG TCTGCCCTGT GGATGTGTAT GCCACAGACC TGACTTCTGG AGGGCTGACT 624 0 

GAAATGTTCA TTTTAAGCTT TTTCTTCTCT TTCCCTGAAA CACTCAGTTT GGGTTAGGGG 6300 

TCATAGACTA AGACCAAAGA GTCCAGGGTT AGAATCTTGG TGTAAAATTG CAGGCCATCT 6360 

CAGGAAATCT GTGAGCAGAT GGGATTGGCT TTGGGTAAGG TGCGTGTGGA AAATGTCAGT 64 20 




GGGAGCCGGG TCATGGTGGG CCTTTAGCAT 

CTCATGTGAA ACAGGGAGCC ACCAAAACTT 

CCTGTGGAGC AGTGCTCACT CACGAAGGCA 

GGCCAAAAGC ATTCTGTGCT TCTCCACCAC 

GAGTAGACAT TCTGAGGGCC AGCAGTGCAG 

TTAGTCAGCC ATCTGGATAT CAGCTGCGTG 

TTCACTTTCA CCTGACTCTC TTGCCAACCT 

AAC TGAGAAG GAAGCTATTA AGATTCTGGC 

TTGCAGTAAG TCTTTTTTCT GGGCTTCTTC 

ATCATGCCTC TATCCTAGGG AATACTGTGA 

CCTGCTAAGG GGGACCGTCG TGTCAGCACT 

S 

GGTGACTTCA TGGTCATGAC GATTTTCTTC 

H TTTCAAAACT ATGTCCCTTC TTCCGTGACC 
W 

O AAGACAGAGT CTGCTCCAGC CCGGACCTCT 

hj AGGCACATGG CTGGGAGTTG GCTGGGCCAG 

ry 

ACCAATGTCG CACATAGGTC AGGAGCTGAG 

^ CCTCTGTTCT GACCATGACC ACGTTGGGCA 

ill 

ffj CCTATATCAC AGCCTTGGAT TTCTATATCG 

m 

._n TGTTGGAGTT TGCTGTGCTC AACTTCCTGA 

- ? CTAAACTCCG CCATGTATGA GCTGGGTATG 

ACATGCTAGC AAGGGTACTG GGGTTATGGC 

ATGTTACCCA GGGTGGTGGC GGGGTTGAAT 

AGGAGAGAAG CCAAGATATG GTGTGCCAAA 

TTAGACTTGG GTTATCTTCC CTTGACTTTT 

CGTATCAATA GCCGTGCCCA TGCCCGTACC 

CATCAGGAAG CTTTTGTGTG CCAGATTGTC 

CCGTCTTGCT CAGCCCAGCA GCCCCCTAGC 

TGCTCCAAGC TGGCCTGCTG TGAGTGGTGC 

CCCGATTGTG AGGGCAGTAC CTGGCAGCAG 

GATAACTACT CGAGAGTTGT TTTCCCAGTG 

CTTGTTTGCC TTAACTTGTA GGTACCAGCT 
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CAGATTCCAG AGTGCAGATA GTCTGTATAG 64 80 

TGGGGAGCAG GCTAGTGCCG GTTTTGACCA 654 0 

TTTTGCCATC ACATGAATGT GCAGAAAGGA 6600 

AGC AC AG AC T TCCCTAGTCT CATTTGCTGA 6660 

GTGTGATGTG CCTCAGAGGG TATGAAGCCC 6720 

GGCATGATAT CTAGAAGGCT AATTGATTTT 6780 

GCAGAGACAG ACATTGGGTG TAGGACAGTG 684 0 

CTTGGCTTAG CTCTCAACTG GCCATTGGTC 6900 

TGGTCCTATT TGTATGTATT GCATTGTCAC 6960 

GCTGAAAAAT GAGACCCTTA CTGTTCACGT 7020 

GTAATGCAGT GATGTTTTTT GTGTCTTTCA 7080 

AATGTGAGCA GGCGGTTTGG CTATGTTGCC 714 0 

ACGATGCTCT CCTGGGTTTC CTTTTGGATC 7200 

CTAGGTAAGA G G AG AAAC AG GTATACGCAT 72 60 

GGCAGAGTTG CCTTGTCATG GAGTCTTTTA 7 320 

CCCATCACTC TTGTGCTCTT GCAGGGATCA 7380 

CCTTTTCTCG TAAGAATTTC CCGCGTGTCT 74 40 

CCATCTGCTT CGTCTTCTGC TTCTGCGCTC 7500 

TCTACAACCA GACAAAAGCC CATGCTTCTC 75 60 

GGAGTGGTGG CAAGGCTTTG GAGTGTAGAG 7 620 

ACATGGGTGG TCAGCTTGCT GAG T GAT GG A 7 68 0 

CAACTTCCTG ATGTAATGGT GAGAAGTTGG 774 0 

GACAGTTTCC AGAAAATCCG GAGGCAGCAC 7 800 

CCCCACTTCT TTCCTTGTCC ATTTTAGCCT 7 8 60 

CGTGCACGTT CCCGAGCCTG TGCCCGCCAA 7 920 

ACCACTGAGG GAAGTGATGG AGAGGAGCGC 7 980 

CCAGGTAGCC CTGAGGGTCC CCGCAGCCTC 804 0 

AAGCGTTTTA AGAAGTACTT CTGCATGGTC 8100 

GGCCGCCTCT GCATCCATGT CTACCGCCTG 8160 

ACTTTCTTCT TCTTCAATGT GCTCTACTGG 8220 

GGTACCCTGT GGGGCAACCT CTCCAGTTCC 8280 



• # 
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CCAGGAGGTC CAAGCCCCTT GCCAAGGGAG TTGGGGGAAA GCAGCAGCAG CAGCAGGAGC 834 0 

GACTAGAGTT TTTCCTGCCC CATTCCCCAA ACAGAAGCTT GCAGAGGGTT TGTCTTTGCT 8 4 00 

GCCCCTCTCC CCTACCTGGC CCATTCACTG AGTCTTCTCA GC AGAC CAT T TCAAATTATT 8 4 60 

AATAAATGGG CCACCTCCCT CTTCTTCAAG GAGCATCCGT GATGCTCAGT GTTCAAAACC 8520 

ACAGCCACTT AGTGATCAGC TCCCTAAAAC CATGCCTAAG TACAGGCGGA TTAGCTATCT 8 580 

TCCAACAATG CTGACCACCA GACAATTACT GCATTTTTCC AGAAGCCCAC TATTGCCTTT 8 64 0 

GCAGTGCTTT CGGCCCAGTT CTGGCCTCAG CCTCAAAGTG CACCGACTAG TTGCTTGCCT 8 700 

ATACCTGGCA CCTCATTAAG ATGCTGGGCA GCAGTATAAC AGGAGGAAGA GATCCCTCTC 87 60 

CTTTGGTCAG ATTATTATGT TCTCAGTTCT CTCTCCCTGC TACCCCTTTC TCTGCAGTTA 8 820 

GATAGACACT GGCATTATCC CTTTAGGAAG AGGGGGGGGC AGCAAGAGAG CCTATTTGGG 8 8 80 

ACAGCATTCC TCTCTCTCTG CTGCTGTGAC ATCTCCCTCT CCTTGCTGGC TCCATCTTTC 8 94 0 

o 

•^j GTCTGCACTA CCAATTCAAT GCCCTTCATC CAATGGGTAT CTATTTTTGT GTGTGATTAT 9000 

r| AGTAACTACT CCCTGCTTTA TATGCCACCC TCTTCCTTCT CTTTGACCCC TGTGACTCTT 90 60 



S3 TCTGTAACTT TCCCAGTGAC TTCCCCTAGC CCTGACCCAG GCACTAGGCC TTGGTGACTT 9120 

S3 

y CCTGGGGCCA AGAAACTAAG GAAACTCGGC TTTGCAACAG GCATTGCTCG CCATTGATTG 9180 

hi 

GTGCCCACCC AGGGCACACT GTCGGAGTTC TATCACTTGC TTGACCCCTG GACCCATAAA 92 4 0 

y CCAGTCCACT GTTATACCCG GGGCACTCTA AC CAT C AC AA TCAATCAATC AAATTCCCTT 9300 

Pj AAATTTGTAT GGCACTGGAA CTTTGGCAAA GCACTTTTGA CAAGTTGTGT CTGATTGGAG 9360 

01 

C T T CAT GAT A GCCTTGTGAC ATCTTTAGGG CAGGATTCTT ATCCCCATTT TGCAGATGAA 94 20 

^ AACCCTGAGT C AC AG AT T T C TGTGGGACTG TGGATCTCAC TGGAAGCTAT CCAAGAGCCC 94 80 

ACTGTCACCT TCTAGACCAC AT G AT AGGGC TAGACAGCTC AGTTCACCAT GATTCTCTTC 954 0 

TGTCACCTCT GCTGGCACAC CAGTGGCAAG GCCCAGAATG GCGACCTCTC TTTAGCTCAA 9600 

TTTCTGGGCC TGAGGTGCTC AGACTGCCCC C 9631 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
• (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTCTACCAGA CCTGGTATGA TGAGCGCCTC TGTTACAATG ACACCTTTGA GACACTTATT 60 



• # 



TGGAGGGTAG GTTCACGCCT TACTCATCTT 

TTGAATGCAC TTCAACAACT GGGCCAAAGG 

AAAAGTCCAA AACATGGCTC AAGCTAGCAC 

CAGTCTCATT CATTCCTCTT TTCCACATTG 

AT AT AC AAG T GGGAGAATTT C AAAC T C AAA 

GAGTTTGATT TTACAGGAGT GAACAACAAA 

TGTGCAGTGG GAGCTAGAGC TGGGCAGAGT 

GCTATTTGTA CCAATGCTAT GTAAAGAGCT 

C T C T CAT AT A TGCTAAACAC TTACAACTGT 

GTCTCAGTCT CATCTGCAAA GAAGTAAGTG 

GTAGATATAT GTTTGTAACC TTCAGTATGT 

a 

TATTTAATTT CTTTCTTATA TCACTGACTA 

7f\ TGTGATTTTA CCAAATCCTG AACCAGGTCA 

^ AGCAGGAGGT AG AAG T G G AA AC TAG AG T G A 

CO 

U GCAGATGGTT ACTCAGGTGC AATGGCTACA 

ru 

. GAG T AC AT AT TTTCACCAGA GGCTTCAGAG 

2 s ; TGATCTTGTT CCTGACAAGG ATAATAGGGG 

nJ 

RJ CAAATAGATG TGAGCATCTT TGCTTGTATT 

yg TCTTCTCAGG GTACCTCTGA GAGCAACCCT 

^ TGGATGGATC AGGTCACCCA GCCAGACTTT 

T GAT G AC AT A AAAAAATCTG CTATTGCTAC 
CTAACCCAAC TAAAGCTTTG CTTATACAAT 
TTCCTGGGAC TCTGAGAAAG CCTTCTTGCT 
TACATAAATG TGGGCCTGCC TCTTAGTATT 
TCAGAAGTGA CAGAAGAGAC AAGGTTCAGG 
TAGATGGTTT TTGCATTGTT TCAAAATGGT 
CAT CATC AG T GAAGCAATGT CAGTAGGTAA 
GGTCTTCTTT TTTCATTGAC TAG AG AC T AC 
AGTTATTTTT GAGTCAGGTT CCAACTCTCC 
TAGAACACAC ATGTACTCAT TTTTCAGGGT 
CATCTTGGAC CTTACAGCAG AGTTTCTGCT 
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TGCATCCGAA GCCCAGGGCA AATGTACTTA 1980 

CAGGGTCTCA AAGCTGGTGA AGCCAAAGTC 204 0 

CATGCTTGTG GGGCCCTCAA CTGGGTTTCC 2100 

TTTTCAGTTT CCTATGATGA GCATGAGATG 2160 

ATCGATGCGA AG AAC AC T T G GAAGCTATTG 2220 

ACTGAAATCA TCTCCACCCC AGTTGGTAAG 2280 

CTGAGGGTGT AG T AC C AAC A ACTGGAAGTG 2 34 0 

TATGGAAACA CTGTCTAGGA CATCGCTTCT 2 4 00 

TTGCAGAGGA AACTGAGACT TCATAGCTAT 24 60 

CTTTGCCAAG CCCCTTGAAG GACGGGGTAA 2520 

TAATCTACCT TTTGAGTTTT AGATCTTTTG 2580 

AC T T AG AC AA CATTGGGGAG GGGTACTCTG 2 64 0 

AGTTGGTTGA GTGGGAAGGA AGACCACCCT 2700 

CAATGGCGGC TAGTAGGGAT TGAGGAGTCT 27 60 

GTAATATTAT CTTAAAGGAG CAACTGTGAA 2820 

TGGAATCAAC AT AAAC AT G T TTTTTCCTCC 28 80 

CAGATTCTGG GTCTCTTAAC TTGTATCTCC 2 94 0 

CCTAGGTGAT CTGAATGAGG AGC C AAT AAA 3000 

GGACCTTCTT C AC AT G T AC T GACAAGAATG 3060 

TGACATTTAT TTTGCTTTGC TTCTGTGTGA 3120 

TGTTGGTCAC TCGGTGAGGG GCAGCTTATG 3180 
ACTAGGCAGG GATGGGTGCT GCTTGTCCTG - 32 4 0 

TGCTTTGACT TGGGGTTCTT GTTCTTCTTC 3300 

CAGTTTGTCC AGATAGGTGG AGCCTCATCA 33 60 

GGCTTTCAAG TCACTAGTGG TTCCGTTTAG 34 20 

GCCCTAGTGA CTACAAAGCC CCAGAGCCAG 34 8 0 

GCAACATCTC TTACTGTGGA GAAAGAAATA 354 0 

TTATTCTGAA TTCATACTGT CTTCACTGTC 3600 

CAGTTTGACC CTGGGGCACT TCTTTCTAGC 3660 

TTAAATGCAG CAGTGCCCTG CAACTTGGGC 37 20 

TCAGGAGCTT CTCAGAAAGT GCAGCCTTGT 37 80 
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CTGTAGTAAG TAAACCATCC TTCACTGGTG GCTGTTGAGC CTCTGAGGAA ATCTTGCCAC 38 4 0 

GATCAAGAGG TCAGCATCTT ACAAGGAGAT TATTTTCTTC CCTAGACGTC TCTTTCCATC 3900 

AGAAAATTCT TCTTGCTCAT CTAAAATGGG AGTTTCTGGA ACATTTACCC ATCAGATCTA 3 960 

GTTCTTTTTC CCCTATAACT ACCCAGTTTA CATTAATTTA AGGTACTCTT TCAAAAGTCA 4 020 

AATGTAAAGA TCTAGGGTTA AC AT T AGAGC AC AC AT T T G T ATTCTATTGG GACTCTCCTC 4 080 

CCTAGTTCCT TTTCTTCTCT TGTTCTTCAG TTCACTAGTC CTCTGGGCAG TCTTTTAATT 4140 

GTGTCTTTTG ACAGTGCAGC CATCAACTAT GATCTGGACA AAGCAGGTAA GAACCAAAGT 4 200 

CATCTTTAAT CCATTTTGTT TTGGTTTGGC TTTCAGTCCT AGATACCTCT CTTCCATTGG 4 2 60 

TATTGCTTCT T C AAT CAT C A GAATGAAAGA AACACTTATT AGTATCCTTC TCTTGTCCAA 4 320 

TTTCCAGCTT TTAGTAAATA AAACATTTCT TCTTTAATCA GACTGACTAA AATTTAAAGA 4 380 

ATGAGAAACC CAACAGCACA ATGCATCTTG T G AC AT AC AT ATTAAGGGAG TTTTTAATCA 4 440 

O 

J CAGTCAAGTC TCCAGGACAG AAAGAATAAT AATTTGGTCA TTCTGGCTAG TAGCATAGAA 4 500 



I 1 1 

Hi 



SB 



TGGTTTCTGG TGCAGTGCCG GAGGCATTCT AGTATTCAGG CAGGAAGTTT CCCACTGACC 4 560 

ATAAGCTACT TATTTGCCTT TCTCTGAAGA ACCTGAGAAG CTTTGTTCCT CCAAGGTCAA 4 620 

GTCTCCTAAC ACTAGGAGGA AAGGTTGCTC TCGATCTTCA CGGCCCACTC AGACTTGCCA 4 68 0 

GGTACATGGT CTGTTTTCTG TATAGTCTAC AGTCTAGACA TTTAGTGTTT ATGTCTGCCT 474 0 

TCCAAGGTTA TTTTGACCAG GGGTATCAAA AG T C AC T GAG CACCATTGTT TCCCATGTTA 4 800 

TGCCAGGCTT GGTTGGCTAG GGGCTATGTT AAAGGCTTTA TATGAGCCCT TCCAGGCAGG 4 8 60 

AGCTTGAAGG TCCATTTAAT TTAAGGTCAC AGGTGGCTGT T GC AT C AC AG AACTGGAAAT 4 920 

TGAGGGCTTA ACTTCACAGA AAAATCCATG G T T G AG AC AA TTTAATGTCT AAT G AC AT G G 4 98 0 

AG AG T AAAAG TGATCCTCTG GGCAAACTAA GCATGGAGTC ATAATATTAT G AAG AG CAT T 504 0 

AGAAGCAACA T TAG G AAC AG AAT AT G C C AG CTGGAAATAC AGGCTTGATC AACTTCTGAA 5100 

AGATCCATTT CTATCTAAAG T AAAT AC AT G AGAAGGTCAT TATACCCAAG TGAATGTGAA 5160 

GGCCTACACT GCCTTCCGTC TGAATCTTTT GCAAACCTAA CATCTAAAGC TTGACTGACA 5220 

AG TT CAT ATT CATCTACTTC TATTTTCCTA GGACATTGAA TTTAGGTTAA GCAGTAATAG 5280 

AT C AAG AAG A GGGAGTCCAG GATTTGCTTA CCAGTATAAA TATATGTGGT ATCTAAACTG 534 0 

GGAAGCAGTT CAATTAGTTT TAGGTGAGGC ATTCATGTTG TATGTGTGCA TATGTATGAG 54 00 

TCAAATCCAG TTAGGTCTTA G AAT C AG AAT AC AT AG T G T A G AC AC TAT T A TGTAGCTCAT 54 60 

ATATAGCAGA ATATGTCATA TTTGGCTAAC CCAAGTGGGC ATGCTATGAG GACTAACAGG 5520 

GCAGATAATG TTGTGACATC ACAAACTATA CATTGCATTT CTGTGGGTCT CAGCTAGCCA 5580 

TCAGCTATTA GGTTTTAATA CCATGAAAGT AAAGGTTATG GGGTTGTTTC TTGTTTGTTT 564 0 




GATTTGTTTT GAACTTTTCT CTCACTACTG 

TGAATCAGTT AACTGAAAGT AGGTTTATTA 

GCAATGGGTC T T AC AG T AAA TATCTGAGTT 

T TAT TAT AG A ACTTTGAAAG GGTTTCCCAT 

GCCATGCTGG ACATCAGGCC AGTACTGCAA 

ACTTCATGGT CATGACATTC TTCTTCAATG 

AAAAC TAT AT CCCTTCATCT GTTACCACAA 

TAGAAGCTGC TGCTGCCAGG GCCTCTGTAG 

CATATGTCAT AGATTAACAT TGCCTTCTCC 

CCCACCTTCA CTCCTTGCTC TTATAGGGGT 

TACCTTTTCT CGTAAGAATT TCCCTCGTGT 

0 TGCAATTTGT TTCGTCTTGT GCTTCTGTAC 

GACCTACAAT AATATTGAAC GACAGGCTTC 

P TTGGTATGGG AAGCAAAACT T TAG AC T G AA 

OP 

yj CCTTTCCAAG TGATGAGAAT TTCCCTGGGG 

ru 

CAGGAAGGCC TAGAACTGAT GTGGAGAGTG 
U TTCTTGACTC ACTTTCCCTT GACCATTTTG 

ru 

fU CCAATAGCCG TGCTAATGCA CGTACTCGTG 

m 

CCCGTGCTCG TCAGCAGCAG GAAGTGTTTG 
CTGAAGAGGG TTACCAGTGG TCTCCAAGAT 
GTGGCCGAAG CTATGTGTGC TTCAGGGTTC 
GTGAGGGCAA CAGCTGGCAG CGGGGCCGCA 
ACTCGCGGGT GCTTTTCCCC ATTACATTCT 
GCCTTAACCT GTAGGCTCCA GCTGGTAGCT 
GTCCTAAGCC CCTTTGTCAA GGGAGTTGGG 
GAGTTTGTCT TTGCTGCTCC TCCTTTTGTT 
TTCTCAAGTA ACAGCCCACC TCTCTATTCT 
CACAATTGAG C AG AT AAGC T ATCTCTTGGT 
GGTCATGGTA CCATTTGTGG TGCTAAGGGG 
GAGCAACAAC AGTATAGCAG AAGGAAGGTC 
TGATTTTTTC TCCCTGCTCT CTCTTCCTCT 
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CTTCTCTGGG AAAAACAGAT TTTGGGAGAA 5700 

ATACTCTTGT ACCAGCTTAT CTCTCAATTA 57 60 

TCTACTTTGC TGCACTGAAA TACTGTGCCT 5820 

GAGCTGGATG GTGGGAGCCA CACTATATGT 5880 

TGCAGTGGTG TATTTGATGT CTTTCAGGTG 594 0 

TAAGCAGGAG GTTTGGCTTC ATTGTCTTTC 6000 

TGCTTTCCTG GGTGTCCTTC TGGATCAAGA 6060 

GTAAGAAAGT ATGGTGATCT TAAATGTGAT 6120 

TGTTGTCCTG CAGCACTGTC AGGAGCTTAG 618 0 

CAGTTCTGTG CTCACCATGG CCACACTGGG 624 0 

CTCCTATCTC ACAGCTTTGG ACTTCTATAT 6300 

TCTACTAGAG TTCACTGTGC TCAACTTCCT 6360 

TCCAAAGTTC TACCAAGTAA GAACCACGTG 6420 

GATAGGAGGG TTATGGCACT TGTGTGACTA 64 80 

T T AT AG AGGA AATAAACTTG CTGTGATATC 654 0 

TAGTTTCTAG GAAATTCTAG GCAACTTTTC 6600 

TTATCTTTAT TTTTCTATTT CAGTTTCCAA 6660 

CTCGTGCCCG CACTCGTGCT CGTGCTCGTG 6720 

TGTGTGAGAT TGTTACCTAT GAGGAAAATG 678 0 

CAAGAAGACC TCAGTGTCCC TGGAGGCGAT 684 0 

TCAGGAAGTA TTTCTGCATG GTTCCTGGTT 6900 

TCTGCATCCA CGTTTATCGC CTGGATAACT 6960 

TCTTCTTTAA TGTGGTCTAC TGGGTGATTT 7 020 

CATGGGGCAA TCACCTCAGT TCCCCAGGAG 7 08 0 

AGACAATAGC AACAGCAGCA AGCCTGAGGA 714 0 

GGCCCCTTCA CTCAATCTTT TAAACAGCAC 7200 

TCAAAGAGTA TCCATGATGT TCAGTGTGCC 7 2 60 

TGTGTTTGCA GTTACTGTTT TTCCCCAGGA 7 320 

CCCAGCTCAA GCTCAGCCCT AGAATGCACA 7380 

CTCATCCTTA AGTCTTTAAG TCAGGCTATT 744 0 

ACACACACA 7 4 7 9 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGCGACCTCC 10 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
% (D) TOPOLOGY: linear 



rv 



80 



(ii) MOLECULE TYPE: cDNA 



CO 

ru 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
O TCCAGTCGAG 10 

(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTGAGTCTCC 10 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AATGTTATAG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



= 3{ 



HI 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GGTCGAGGGA 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
M (B) TYPE: nucleic acid 

03 (C) STRANDEDNESS: double 

yj (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GGCATTGGAG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GTGAGGAGCA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



m 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
TCCCTTATAG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGAAGCCCAC 



00 

yj (2) INFORMATION FOR SEQ ID NO: 16: 

fu 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
y (B) TYPE: nucleic acid 

FU (C) STRANDEDNESS: double 

fy (D) TOPOLOGY: linear 

CP 

^ (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CCTAGACATG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GT GAG T ACTA 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTCCCACCAG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAATACACCA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ACACAATTAG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
GTATGTCAAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
TCCCTTCCAG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
GATGACCATT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
TTCTCTAGCT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



UJ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GTGAGTACCT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TCCTTTCCAG 



CO 

y (2) INFORMATION FOR SEQ ID NO: 27: 

hi 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
^ (B) TYPE: nucleic acid 

TU (C) STRANDEDNESS: double 

fU (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TTTCCTATCC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
ACCCCAGTTG 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GTAAGCGTGC 

(2) INFORMATION FOR SEQ ID NO: 30: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

J*! (ii) MOLECULE TYPE: cDNA 

w 
u 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
M GTCTTTTCAG 

5=5 i 
5fc> 

fU (2) INFORMATION FOR SEQ ID NO: 31: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(i) 



a 



(i) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GTGACTTCAT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
ACCTCTCTAG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
GTAAGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 34: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

rrrj 

ru 

§ys (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

^ GCTCTTGCAG 

OB 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(i) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
GGATCACCTC 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
ACTCCGCCAT 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
GTATGAGCTG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
TCCATTTTAG 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
CCTCGTATCA 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ATATATCAGC 

(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 41.. 1558 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 41 . . 94 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 95.. 1558 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CGCGACCTCC GCGCAGGTGG TCGCGCCGGT CTCCGCGGAA ATG TTG TCC AAA GTT 

Met Leu Ser Lys Val 
-18 -15 

CTT CCA GTC CTC CTA GGC ATC TTA TTG ATC CTC CAG TCG AGG GTC GAG 

Leu Pro Val Leu Leu Gly lie Leu Leu lie Leu Gin Ser Arg Val Glu 
-10 -5 1 

GGA CCT CAG ACT GAA TCA AAG AAT GAA GCC TCT TCC CGT GAT GTT GTC 

Gly Pro Gin Thr Glu Ser Lys Asn Glu Ala Ser Ser Arg Asp Val Val 
5 10 15 

TAT GGC CCC CAG CCC CAG CCT CTG GAA AAT CAG CTC CTC TCT GAG GAA 

Tyr Gly Pro Gin Pro Gin Pro Leu Glu Asn Gin Leu Leu Ser Glu Glu 

20 25 30 35 



ACA AAG TCA ACT GAG ACT GAG ACT GGG AGC AGA GTT GGC AAA CTG CCA 
Thr Lys Ser Thr Glu Thr Glu Thr Gly Ser Arg Val Gly Lys Leu Pro 
40 45 50 
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GAA 
Glu 


GCC 
Ala 


TCT 
Ser 


CGC 
Arg 
55 


ATC 
He 


CTG 
Leu 


AAC 
Asn 


ACT 
Thr 


ATC 
He 
60 


CTG 
Leu 


AGT 
Ser 


AAT 
Asn 


TAT 
Tyr 


GAC 
Asp 
65 


CAC 
His 


AAA 
Lys 


295 




CTG 
Leu 


CGC 
Arg 


CCT 
Pro 
70 


GGC 
Gly 


ATT 
He 


GGA 
Gly 


GAG 
Glu 


AAG 
Lys 
75 


CCC 
Pro 


ACT 
Thr 


GTG 
Val 


GTC 
Val 


ACT 
Thr 
80 


GTT 
Val 


GAG 
Glu 


ATC 
He 


343 




GCC 
Ala 


GTC 
Val 
85 


AAC 
Asn 


AGC 
Ser 


CTT 
Leu 


GGT 
Gly 


CCT 
Pro 
90 


CTC 
Leu 


TCT 
Ser 


ATC 
He 


CTA 
Leu 


GAC 
Asp 
95 


ATG 
Met 


GAA 
Glu 


TAC 
Tyr 


ACC 
Thr 


391 




ATT 
He 
100 


GAC 
Asp 


ATC 
He 


ATC 
He 


TTC 
Phe 


TCC 
Ser 
105 


CAG 
Gin 


ACC 
Thr 


TGG 
Trp 


TAC 
Tyr 


GAC 
Asp 
110 


GAA 
Glu 


CGC 
Arg 


CTC 
Leu 


TGT 
Cys 


TAC 
Tyr 
115 


439 




AAC 
Asn 


GAC 
Asp 


ACC 
Thr 


TTT 
Phe 


GAG 
Glu 
120 


TCT 
Ser 


CTT 
Leu 


GTT 
Val 


CTG 
Leu 


AAT 
Asn 
125 


GGC 
Gly 


AAT 
Asn 


GTG 
Val 


GTG 
Val 


AGC 
Ser 
130 


CAG 
Gin 


487 




CTA 
Leu 


TGG 
Trp 


ATC 
He 


CCG 
Pro 
135 


GAC 
Asp 


ACC 
Thr 


TTT 
Phe 


TTT 
Phe 


AGG 
Arg 
140 


AAT 
Asn 


TCT 
Ser 


AAG 
Lys 


AGG 
Arg 


ACC 
Thr 
145 


CAC 
His 


GAG 
Glu 


535 


w 


CAT 
His 


GAG 
Glu 


ATC 
lie 
150 


ACC 
Thr 


ATG 
Met 


ccc 

Pro 


AAC 
Asn 


CAG 
Gin 
155 


ATG 
Met 


GTC 
Val 


CGC 
Arg 


ATC 
He 


TAC 
Tyr 
160 


AAG 
Lys 


GAT 
Asp 


GGC 
Gly 


583 


w 

ru 


AAG 
Lys 


GTG 
Val 
165 


TTG 
Leu 


TAC 
Tyr 


ACA 
Thr 


ATT 

He 


AGG 
Arg 
170 


ATG 
Met 


ACC 
Thr 


ATT 
He 


GAT 
Asp 


GCC 
Ala 
175 


GGA 
Gly 


TGC 
Cys 


TCA 
Ser 


CTC 
Leu 


631 




CAC 
His 
180 


ATG 
Met 


CTC 
Leu 


AGA 
Arg 


TTT 
Phe 


CCA 
Pro 
185 


ATG 
Met 


GAT 
Asp 


TCT 
Ser 


CAC 
His 


TCT 
Ser 
190 


TGC 
Cys 


CCT 
Pro 


CTA 
Leu 


TCT 
Ser 


TTC 
Phe 
195 


679 




TCT 
Ser 


AGC 
Ser 


TTT 
Phe 


TCC 
Ser 


TAT 
Tyr 
200 


CCT 
Pro 


GAG 
Glu 


AAT 
Asn 


GAG 
Glu 


ATG 
Met 
205 


ATC 
He 


TAC 
Tyr 


AAG 
Lys 


TGG 
Trp 


GAA 
Glu 
210 


AAT 
Asn 


727 




TTC 
Phe 


AAG 
Lys 


CTT 
Leu 


GAA 
Glu 
215 


ATC 
He 


AAT 
Asn 


GAG 
Glu 


AAG 
Lys 


AAC 
Asn 
220 


TCC 
Ser 


TGG 
Trp 


AAG 
Lys 


CTC 
Leu 


TTC 
Phe 
225 


CAG 
Gin 


TTT 
Phe 


775 




GAT 
Asp 


TTT 
Phe 


ACA 
Thr 
230 


GGA 
Gly 


GTG 
Val 


AGC 
Ser 


AAC 
Asn 


AAA 
Lys 
235 


ACT 
Thr 


GAA 
Glu 


ATA 
He 


ATC 
He 


ACA 
Thr 
240 


ACC 
Thr 


CCA 
Pro 


GTT 
Val 


823 




GGT 
Gly 


GAC 
Asp 
245 


TTC 
Phe 


ATG 
Met 


GTC 
Val 


ATG 
Met 


ACG 
Thr 
250 


ATT 
He 


TTC 
Phe 


TTC 
Phe 


AAT 
Asn 


GTG 
Val 
255 


AGC 
Ser 


AGG 
Arg 


CGG 
Arg 


TTT 
Phe 


871 




GGC 
Gly 
260 


TAT 
Tyr 


GTT 
Val 


GCC 
Ala 


TTT 
Phe 


CAA 
Gin 
265 


AAC 
Asn 


TAT 
Tyr 


GTC 
Val 


CCT 
Pro 


TCT 
Ser 
270 


TCC 
Ser 


GTG 
Val 


ACC 
Thr 


ACG 
Thr 


ATG 
Met 
275 


919 




CTC 
Leu 


TCC 
Ser 


TGG 
Trp 


GTT 
Val 


TCC 
Ser 
280 


TTT 
Phe 


TGG 
Trp 


ATC 
He 


AAG 
Lys 


ACA 
Thr 
285 


GAG 
Glu 


TCT 
Ser 


GCT 
Ala 


CCA 
Pro 


GCC 
Ala 
290 


CGG 
Arg 


967 




ACC 
Thr 


TCT 
Ser 


CTA 
Leu 


GGG 
Gly 


ATC 
He 


ACC 
Thr 


TCT 
Ser 


GTT 
Val 


CTG 
Leu 


ACC 
Thr 


ATG 
Met 


ACC 
Thr 


ACG 
Thr 


TTG 
Leu 


GGC 
Gly 


ACC 
Thr 


1015 



& 
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295 300 305 

TTT TCT CGT AAG AAT TTC CCG CGT GTC TCC TAT ATC ACA GCC TTG GAT 1063 

Phe Ser Arg Lys Asn Phe Pro Arg Val Ser Tyr lie Thr Ala Leu Asp 
310 315 320 

TTC TAT ATC GCC ATC TGC TTC GTC TTC TGC TTC TGC GCT CTG TTG GAG 1111 

Phe Tyr lie Ala lie Cys Phe Val Phe Cys Phe Cys Ala Leu Leu Glu 
325 330 335 

TTT GCT GTG CTC AAC TTC CTG ATC TAC AAC CAG ACA AAA GCC CAT GCT 1159 

Phe Ala Val Leu Asn Phe Leu lie Tyr Asn Gin Thr Lys Ala His Ala 

340 345 350 355 

TCT CCT AAA CTC CGC CAT CCT CGT ATC AAT AGC CGT GCC CAT GCC CGT 1207 

Ser Pro Lys Leu Arg His Pro Arg lie Asn Ser Arg Ala His Ala Arg 
360 365 370 

ACC CGT GCA CGT TCC CGA GCC TGT GCC CGC CAA CAT CAG GAA GCT TTT 1255 

Thr Arg Ala Arg Ser Arg Ala Cys Ala Arg Gin His Gin Glu Ala Phe 
375 380 385 

o 

GTG TGC CAG ATT GTC ACC ACT GAG GGA AGT GAT GGA GAG GAG CGC CCG 1303 

g Val Cys Gin lie Val Thr Thr Glu Gly Ser Asp Gly Glu Glu Arg Pro 
rj 390 395 400 

O TCT TGC TCA GCC CAG CAG CCC CCT AGC CCA GGT AGC CCT GAG GGT CCC 1351 

Uj Ser Cys Ser Ala Gin Gin Pro Pro Ser Pro Gly Ser Pro Glu Gly Pro 
\x\ 405 410 415 

CGC AGC CTC TGC TCC AAG CTG GCC TGC TGT GAG TGG TGC AAG CGT TTT 1399 

Arg Ser Leu Cys Ser Lys Leu Ala Cys Cys Glu Trp Cys Lys Arg Phe 

420 ^25 430 435 

iy 

=fU AAG AAG TAC TTC TGC ATG GTC CCC GAT TGT GAG GGC AGT ACC TGG CAG 14 4 7 

ffj Lys Lys Tyr Phe Cys Met Val Pro Asp Cys Glu Gly Ser Thr Trp Gin 

Tr-\ 440 445 4 50 



CAG GGC CGC CTC TGC ATC CAT GTC TAC CGC CTG GAT AAC TAC TCG AGA 14 95 

Gin Gly Arg Leu Cys lie His Val Tyr Arg Leu Asp Asn Tyr Ser Arg 
455 460 465 

GTT GTT TTC CCA GTG ACT TTC TTC TTC TTC AAT GTG CTC TAC TGG CTT 154 3 

Val Val Phe Pro Val Thr Phe Phe Phe Phe Asn Val Leu Tyr Trp Leu 
470 475 480 

GTT TGC CTT AAC TTG TAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 1591 
Val Cys Leu Asn Leu * 
485 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 1651 

AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 1711 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTCTTCTCAG CAGACCATTT 1771 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 1831 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 1891 

TAGCTATCTT CCAACAATGC TG ACC ACC AG ACAATTACTG CATTTTTCCA GAAGCCCACT 1951 



G 

W 
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ATTGCCTTTG TAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2011 

TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAGGAAGAG 2071 

ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 2131 

CTGCAGATAG AT AG AC AC T G GCATTATCCC TTCAGGAAGA GGGGGGGGCA GCAAGAGAGC 2191 

CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 2251 

CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 2311 

TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 2371 

GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCCAGG CACTAGGCCT 2 4 31 

TGGTGACTTC CTGGGGCCAA GAAACTAAGG AAACTCGGCT TTGCAACAGG CATTACTCGC 24 91 

CATTGATTGG TGCCCACCCA GGGCACACTG TCGGAGTTCT ATCACTTGCT TGACCCCTGG 2551 

ACCCATAAAC CAGTCCACTG TTATACCCGG GGCACTCTAA C CAT C AC AAT CAATCAATCA 2 611 

AATTCCCTTA AATTTGTATG GCACTGGAAC TTTGGCAAAG CACTTTTGAC AAGTTGTGTC 2 671 

TGATTGGAGC T T CAT GAT AG CCTTGTGACA TCTTTAGGGC AGGATTCTTA TCCCCATTTT 2731 

G C AG AT G AAA ACCCTGAGTC AC AG AT T T C T GTGGGACTGT GGATCTCACT GGAAGCTATC 27 91 

hj CAAGAGCCCA CTGTCACCTT C TAG AC C AC A TGATAGGGCT AG AC AG C T C A GTTCACCATG 28 51 

ru 

ATTCTCTTCT GTCACCTCTG CTGGCACACC AGTGGCAAGG CCCAGAATGG CGACCTCTCT 2 911 

feJ TTAGCTCAAT TTCTGGGCCT GAGGTGCTCA GACTGCCCCC AAGATCAAAT CTCTCCTGGC 2 971 

ru 

HJ TGTAGTAACC CAGTGGAATG AAT T T G G AC A TGCCCCAATG CTTCTATATG CTAAGTGAAA 3031 

TCTGTGTCTG TAATTTGTTG GGGGGTGGAT AGGGTGGGGT CTCCATCTAC TTTTTGTCAC 3091 

^ CATCATCTGA AATGGGGAAA TAT G T AAAT A AAT AT AT C AG CAAAGCAAAA AGAAAAAAAA 3151 

AAA 3154 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Leu Ser Lys Val Leu Pro Val Leu Leu Gly lie Leu Leu lie Leu 

-18 -15 -10 -5 

Gin Ser Arg Val Glu Gly Pro Gin Thr Glu Ser Lys Asn Glu Ala Ser 

15 10 

Ser Arg Asp Val Val Tyr Gly Pro Gin Pro Gin Pro Leu Glu Asn Gin 

15 20 25 30 



Leu Leu Ser Glu 



Val Gly Lys Leu 
50 

Asn Tyr Asp His 
65 

Val Thr Val Glu 
80 

Asp Met Glu Tyr 
95 

Glu Arg Leu Cys 



Asn Val Val Ser 
130 

Lys Arg Thr His 
145 

lie Tyr Lys Asp 
160 

Ala Gly Cys Ser 
175 

Cys Pro Leu Ser 



Tyr Lys Trp Glu 
210 

Lys Leu Phe Gin 
225 

lie Thr Thr Pro 
240 

Val Ser Arg Arg 
255 

Ser Val Thr Thr 



Ser Ala Pro Ala 
290 

Thr Thr Leu Gly 
305 

lie Thr Ala Leu 
320 

Cys Ala Leu Leu 
335 

Thr Lys Ala His 



Glu Thr Lys Ser 
35 

Pro Glu Ala Ser 



Lys Leu Arg Pro 
70 

lie Ala Val Asn 
85 

Thr lie Asp lie 
100 

Tyr Asn Asp Thr 
115 

Gin Leu Trp lie 



Glu His Glu lie 
150 

Gly Lys Val Leu 
165 

Leu His Met Leu 
180 

Phe Ser Ser Phe 
195 

Asn Phe Lys Leu 



Phe Asp Phe Thr 
230 

Val Gly Asp Phe 
245 

Phe Gly Tyr Val 
260 

Met Leu Ser Trp 
275 

Arg Thr Ser Leu 



Thr Phe Ser Arg 
310 

Asp Phe Tyr lie 
325 

Glu Phe Ala Val 
340 

Ala Ser Pro Lys 
355 
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Thr Glu Thr Glu 
40 

Arg lie Leu Asn 
55 

Gly lie Gly Glu 



Ser Leu Gly Pro 
90 

lie Phe Ser Gin 
105 

Phe Glu Ser Leu 
120 

Pro Asp Thr Phe 
135 

Thr Met Pro Asn 



Tyr Thr lie Arg 
170 

Arg Phe Pro Met 
185 

Ser Tyr Pro Glu 
200 

Glu lie Asn Glu 
215 

Gly Val Ser Asn 



Met Val Met Thr 
250 

Ala Phe Gin Asn 
265 

Val Ser Phe Trp 
280 

Gly lie Thr Ser 
295 

Lys Asn Phe Pro 



Ala lie Cys Phe 
330 

Leu Asn Phe Leu 
345 

Leu Arg His Pro 
360 



Thr Gly Ser Arg 
45 

Thr lie Leu Ser 
60 

Lys Pro Thr Val 
75 

Leu Ser lie Leu 



Thr Trp Tyr Asp 
110 

Val Leu Asn Gly 

,125 

Phe Arg Asn Ser 
140 

Gin Met Val Arg 
155 

Met Thr lie Asp 



Asp Ser His Ser 
190 

Asn Glu Met lie 
205 

Lys Asn Ser Trp 
220 



Lys Thr Glu lie 
235 

lie Phe Phe Asn 



Tyr Val Pro Ser 
270 

lie Lys Thr Glu 
285 

Val Leu Thr Met 
300 

Arg Val Ser Tyr 
315 

Val Phe Cys Phe 



lie Tyr Asn Gin 
350 

Arg lie Asn Ser 
365 
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Arg Ala His Ala Arg Thr Arg Ala Arg Ser Arg Ala Cys Ala Arg Gin 
370 375 380 

His Gin Glu Ala Phe Val Cys Gin He Val Thr Thr Glu Gly Ser Asp 
385 390 395 

Gly Glu Glu Arg Pro Ser Cys Ser Ala Gin Gin Pro Pro Ser Pro Gly 
400 405 410 

Ser Pro Glu Gly Pro Arg Ser Leu Cys Ser Lys Leu Ala Cys Cys Glu 
415 420 425 430 

Trp Cys Lys Arg Phe Lys Lys Tyr Phe Cys Met Val Pro Asp Cys Glu 
435 440 445 

Gly Ser Thr Trp Gin Gin Gly Arg Leu Cys He His Val Tyr Arg Leu 
450 455 460 

Asp Asn Tyr Ser Arg Val Val Phe Pro Val Thr Phe Phe Phe Phe Asn 
465 470 475 

Val Leu Tyr Trp Leu Val Cys Leu Asn Leu 
480 485 

(2) INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG TCCAAGCCCC 
TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG TTTTTCCTGC 
CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTTTTTG CTGCCCCTCT CCCCTACCTG 
GCCCATTCAC TGAGTCTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG GGCCACCTCC 
CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC TTAGTGATCA 
GCT 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GCTTTGCTGA TATATTTATT TACATATTTC CCCATTTCAG ATGATGGTGA CAAAAAGTAG 
ATGGAGACCC CACCCTATCC ACCCCCCAAC AAAT T AC AG A CACAGATTTC ACTTAGCATA 
TAGAAGCATT GGGGCATGTC CAAATTCATT CCACTGGGTT ACTACAGCCA GGAGAGATTT 
GATCTTGGGG GCAGTCTGGA GCACCT 

(2) INFORMATION FOR SEQ ID NO : 4 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGTACCTAAG TGAGTAGGGC GTCCGATCGA CGGACGCCTT TTTTTTGAAT TCGTAATCAT 
GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG 
CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG 
CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA 
TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA 
CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG 
TAATACGGTT AT C C AC AG AA TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGGCC 
AGCAAAAGGC CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 
CCCCTGACGA G CAT C AC AAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 
TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 
TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 
GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 
ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA 
ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 
CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 
GAAGAACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 
GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 
AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT 



# 
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CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCGTCGA 114 0 

CAATTCGCGC GCGAAGGCGA AGCGGCATGC ATTTACGTTG AC AC CAT C G A ATGGTGCAAA 1200 

ACCTTTCGCG GTATGGCATG ATAGCGCCCG GAAGAGAGTC AATTCAGGGT GGTGAATGTG 12 60 

AAACCAGTAA CGTTATACGA TGTCGCAGAG TATGCCGGTG TCTCTTATCA GACCGTTTCC 1320 

CGCGTGGTGA ACCAGGCCAG CCACGTTTCT GCGAAAACGC GGGAAAAAGT GGAAGCGGCG 1380 

ATGGCGGAGC TGAATTACAT TCCCAACCGC GTGGCACAAC AACTGGCGGG CAAACAGTCG 14 4 0 

TTGCTGATTG GCGTTGCCAC CTCCAGTCTG GCCCTGCACG CGCCGTCGCA AATTGTCGCG 1500 

GCGATTAAAT CTCGCGCCGA TCAACTGGGT GCCAGCGTGG TGGTGTCGAT GGTAGAACGA 1560 

AGCGGCGTCG AAGCCTGTAA AGCGGCGGTG CACAATCTTC TCGCGCAACG CGTCAGTGGG 1620 

CTGATCATTA ACTATCCGCT GGATGACCAG GATGCCATTG CTGTGGAAGC TGCCTGCACT 168 0 

AATGTTCCGG CGTTATTTCT TGATGTCTCT GACCAGACAC CCATCAACAG TATTATTTTC 17 4 0 

TCCCATGAAG ACGGTACGCG ACTGGGCGTG GAGCATCTGG TCGCATTGGG TCACCAGCAA 18 00 

ATCGCGCTGT TAGCGGGCCC ATTAAGTTCT GTCTCGGCGC GTCTGCGTCT GGCTGGCTGG 18 60 

CATAAATATC TCACTCGCAA TCAAATTCAG CCGATAGCGG AACGGGAAGG CGACTGGAGT 1920 

GCCATGTCCG GTTTTCAACA AACCATGCAA ATGCTGAATG AGGGCATCGT TCCCACTGCG 1980 

ATGCTGGTTG CCAACGATCA GATGGCGCTG GGCGCAATGC GCGCCATTAC CGAGTCCGGG 204 0 

CTGCGCGTTG GTGCGGATAT CTCGGTAGTG GGATACGACG ATACCGAAGA CAGCTCATGT 2100 



FU TATATCCCGC CGTTAACCAC CAT C AAAC AG GATTTTCGCC TGCTGGGGCA AACCAGCGTG 2160 

,r\ GACCGCTTGC TGCAACTCTC TCAGGGCCAG GCGGTGAAGG GCAATCAGCT GTTGCCCGTC 2220 

**- TCACTGGTGA AAAGAAAAAC CACCCTGGCG CCCAATACGC AAACCGCCTC TCCCCGCGCG 228 0 

TTGGCCGATT CATTAATGCA GCTGGCACGA CAGGTTTCCC GACTGGAAAG CGGGCAGTGA 2 34 0 

GCGCAACGCA ATTAATGTAA GTTAGCGCGA ATTGTCGACC AAAGCGGCCA TCGTGCCTCC 2 4 00 

CCACTCCTGC AGTTCGGGGG CATGGATGCG CGGATAGCCG CTGCTGGTTT CCTGGATGCC 24 60 

GACGGATTTG CACTGCCGGT AGAACTCCGC GAGGTCGTCC AGCCTCAGGC AGCAGCTGAA 2520 

CCAACTCGCG AGGGGATCGA GCCCGGGGTG GGCGAAGAAC TCCAGCATGA GATCCCCGCG 258 0 

CTGGAGGATC ATCCAGCCGG CGTCCCGGAA AACGATTCCG AAGCCCAACC TTTCATAGAA 2 64 0 

GGCGGCGGTG GAATCGAAAT CTCGTGATGG CAGGTTGGGC GTCGCTTGGT CGGTCATTTC 27 00 

GAACCCCAGA GTCCCGCTCA GAAGAACTCG TCAAGAAGGC GATAGAAGGC GATGCGCTGC 27 60 

GAATCGGGAG CGGCGATACC GTAAAGCACG AGGAAGCGGT CAGCCCATTC GCCGCCAAGC 2820 

TCTTCAGCAA TATCACGGGT AGCCAACGCT ATGTCCTGAT AGCGGTCCGC CACACCCAGC 2880 

CGGCCACAGT CGATGAATCC AGAAAAGCGG CCATTTTCCA CCATGATATT C GGCAAGC AG 294 0 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
AAGCTTAAAA AACTGCAAAA AATAGTTTGA CTTGTGAGCG GATAACAATT AAGATGTACC 
CAATTGTGAG CGGATAACAA TTTCACACAT TAAAGAGGAG AAAT T AC AT A TG 



60 
112 



